A Generalized Topic Modeling Approach for Maven Search
نویسندگان
چکیده
This paper addresses the problem of semantics-based maven search in research community, which means identifying a person with some given expertise. Traditional approaches either ignored semantic knowledge or temporal information, resulting in some right mavens that cannot be effectively identified because of non-occurrence of keywords and un-exploitation of time effects. In this paper, we propose a novel semantics and temporal information based maven search (STMS) approach to discover latent topics (semantically related soft clusters of words) between the authors, venues (conferences or journals) and time simultaneously. In the proposed approach, each author in a venue is represented as a probability distribution over topics, and each topic is represented as a probability distribution over words and year of the venue for that topic. Through discovered latent topics we can search mavens by implicitly modeling word-author, author-author and author-venue correlations with continuous time effects. Inference making procedure for topics and authors of new venues is explained. We also show how authors’ correlations can be discovered and the bad effect of topics sparseness on the retrieval performance. Experimental results on the corpus downloaded from DBLP show that proposed approach significantly outperformed the baseline approach, due to its ability to produce less sparse topics.
منابع مشابه
Optimal Shaping of Non-Conventional Permanent Magnet Geometries for Synchronous Motors via Surrogate Modeling and Multi-Objective Optimization Approach
A methodology is proposed for optimal shaping of permanent magnets with non-conventional and complex geometries, used in synchronous motors. The algorithm includes artificial neural network-based surrogate model and multi-objective search based optimization method that will lead to Pareto front solutions. An interior permanent magnet topology with crescent-shaped magnets is also introduced as t...
متن کاملTemporal expert finding through generalized time topic modeling
Please cite this article in press as: A. Daud et al., j.knosys.2010.04.008 This paper addresses the problem of semantics-based temporal expert finding, which means identifying a person with given expertise for different time periods. For example, many real world applications like reviewer matching for papers and finding hot topics in newswire articles need to consider time dynamics. Intuitively...
متن کاملReliability analysis of repairable systems using system dynamics modeling and simulation
Repairable standby system’s study and analysis is an important topic in reliability. Analytical techniques become very complicated and unrealistic especially for modern complex systems. There have been attempts in the literature to evolve more realistic techniques using simulation approach for reliability analysis of systems. This paper proposes a hybrid approach called as Markov system ...
متن کاملAutomatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کاملP/E Modeling and Prediction of Firms Listed on the Tehran Stock Exchange; a New Approach to Harmony Search Algorithm and Neural Network Hybridization
Investors and other contributors to stock exchange need a variety of tools, measures, and information in order to make decisions. One of the most common tools and criteria of decision makers is price-to earnings per share ratio. As a result, investors are in pursuit of ways to have a better assessment and forecast of price and dividends and get the highest returns on their investment. Previous ...
متن کامل